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Maximum flow is approximable by deterministic 
constant-time algorithm in sparse networks 
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Abstract 

We show a deterministic constant-time parallel algorithm for finding an 
almost maximum flow in multisource-multitarget networks with bounded 
degrees and bounded edge capacities. As a consequence, we show that 
the value of the maximum flow over the number of nodes is a testable 
parameter on these networks. 



1 Introduction 



^ ' In the last decade it became apparent that a large number of the most interesting 

C^ , structures and phenomena of the world can be described by networks which are 

so large that the data about them can be collected only by indirect means like 
random local sampling. This yielded the motivation of property testing and 
parameter testing, which became an intensively studied field recently (see [3], 
p2] and all references). There are two special cases that have been treated in 



in 

^-^ • most papers. One is the dense graphs, where a positive fraction of all pairs of 

nodes are connected. The other is the sparse graphs, these are mostly graphs 
with bounded degrees. In this paper, we deal only with parameter testing of 
bounded-degree graphs. A parameter tester of bounded-degree graphs means 

^^ ■ an algorithm which chooses a constant numb er of random nodes, and to these 

constant radius neighbourhoods, assigns an estimation (a number) which is at 
most e far from the true parameter with at least 1 — e probability. We call a 
parameter testable if there exists a tester with arbitrary small errors. 

There is a strongly connected concept called constant-time algorithm, intro- 
duced by Nguyen and Onak[T4]. For example, consider the maximum matching 
problem. Here, a constant-time algorithm is a "local function" that decides 
about each edge that whether it chooses to the mathing or not, so that the 
chosen edges form a matching, and its size is at most en less than the size of 
the maximum matching, with at least 1 — e probability. Locality means that it 
depends only on the constant-size neighbourhood of the edge, including some 
random numbers, as follows. Randomization is strictly required to break sym- 
metry, for example in a regular large-girth graph. The idea of Nguyen and Onak 
was to assign independent random numbers to the nodes uniformly from [0, 1], 



and the neighbourhood consists not only of the induced subgraph of nodes at 
most constant far from the chosen node, but also of the random numbers of 
these nodes. In that paper, they showed a constant-time algorithm for this and 
some other problems. 

About the connection of the two concepts, notice that if we have a constant- 
time algorithm producing the maximum matching then the ratio of the size 
of the maximum matching and the number of nodes is a testable parameter. 
Because we can make a tester which simply calculates the probability that 
the chosen random node would be covered by the matching produced by the 
constant-time algorithm, and the average of these probabilities is a good ap- 
proximation. 

In this paper, we show a constant-time deterministic algorithm for a version 
of the maximum flow problem. This determinism means that we will get the 
analogous result using no random numbers. 

If we delete all edges from all sources or targets then the value of the max- 
imum flow decreases to while the distribution of the local neighbourhoods 
remains asymptotically the same. That is why the value of the maximum flow 
in a graph with 1, or even with o{n) sources or targets cannot be tested in any 
reasonable way. Similarly, one new edge with high capacity between a source 
and a target would increase the value of the maximum flow by this arbitrary 
large value. These are some reasons why we will deal only with multiple sources 
and targets and bounded capacities. 

2 Model and results 

There is an input graph G with degrees bounded by d. Its vertices are separated 
into the disjoint union of the colour sets R (regular), S (source) and T (target). 
Each edge of the graph is considered as two directed edge in the two directions, 
and we have a capacity function c : E{G) -^ [0, M] of the directed edges. The 
word " graph" will include these two structures and we handle d and M as global 
constants throughout the paper. Let 11^(0)1 = n, denote by out{v) the set of 
edges starting from a node v, and for an e e E{G), let — e denote the edge e in 
the opposite direction. We define the flow as a function / : E{G) -^ M. for which 
Ve e E{G) satisfies /(-e) = -/(e) and /(e) < c(e) andWv e S : ^ /(e) > 

eGout{v) 

and Vw e T : ^ /(e) < and Vw G i? : ^ /(e) = 0. The value of a 

e^out{v) e(£out{v) 

flow / is I/I = 2_. /(e)- Denote a maximum flow by /* — f*{G). 

sGS; e^out{s) 

The rooted neighbourhood of a vertex v or edge e of radius r, denoted by 
hr{v) and /ir(e), means the induced subgraph (with colours and edge weights) 
of the vertices at a distance at most r from v or e, with a mark at v or e. The 
set of all possible neighbourhoods are denoted by Hr ' and Hr ■ A local (flow) 
assignment means a function L : Hr — > R for which the function e — > L{hr{e)) 



is a flow for each graph. 

Theorem 1. For each e > there exists a constant-time algorithm producing 
a flow with value > \f*\ ^ en. 

We emphasize that Theorem [1] is about not a stochastic but a deterministic 
assignment. So from this aspect, Theorem[T]is stronger than necessary to prove 
Theorem[5] We get this determinism only by averaging on all random labellings, 
since the space of flows is a convex set and the values of flows is a linear function. 

Theorem 2. |/*(G)| is testable, namely, for all e > there exist fc, r G N and 
a function g : {Hi- )^ -^ R satisfying that if the vertices t'i,t'2, ... Vk are chosen 
independently with uniform distribution 
then E{\\f*{G)\/n - g{hr{vi),hr{v2), ... /irK))|) < £• 

We note that for all e > 0, having an approximation with expectedly < en 
error is stronger but offer equivalent to having this with < en error with at least 
1 — e probability. 

3 Proofs 

First we prove Theorem [T] using the following lemmas, and Theorem [5] will be 
an easy consequence of it. 

An augmenting path of a flow / is a directed path u ~ (ei, 62, ... e^) from 
S to T with /(ei) < c(ei) for each edge e^. The capacity of u means cap{u) = 
cap{u, f) = min(c(ei) — f{&i))^ and we identify an augmenting path u with the 

i 

flow u : E{G) — s> R, u{e) = {1 if 3i : e = e^; — 1 if 3i : e = — e^; otherwise}, 
which we also call path-flow. Augmenting on such a path u means the incre- 
mentation of / by cap{u) ■ u. 



Lemma 3. If for a flow f , there is no augmenting path with length at most I, 

-m 
I 



then I/I > |/*|- ^n. 



Proof. With the identically 2Af capacity function, /* — / is a flow, so we can 
split it into the sum of path- flows wi,M2, •■• Uq and a circulation wq that follow 
the directions of the flow /* — /, namely, \/i G {0, ... g}, e G E{G) : sgn{ui{e)) G 
{0,sgn((/*-/)(e))}. Thus, 

i/*i-i/i-i/*-/i = Ek^i-^E2zki 
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*=1 eeB(G) eeE{G) »=1 eeE{G) 

Lemma 4. // for a flow f , there is no augmenting path shorter than k, then 
augmenting on a path with length k does not create a new augmenting path with 
length at most k. 



Proof. Let the residual graph mean the graph without edge capacities Gf = 
{SiG),RiG),TiG), {e G E{G)\f{e) < c(e)}). Then the augmenting paths of 
G can be identified with the paths in Gf from S to T. So if the length of the 
shortest augmenting path of a flow is k then it means that the length of the 
shortest path in Gf from 5* to T is fc. Let the movement of an edge in G mean 
the difference of the distances of its endpoint and starting point from S in Gf. 
Augmenting on a shortest path adds only such edges to the residual graph on 
which / decreases, which are the reverse edges of the path. All these edges 
have movement —1 (calculated before augmenting). So if a path becomes an 
augmenting path at this augmenting step then all its edges have movements at 
most 1 and contain an edge with movement —1, so its length is at least fc + 2. D 

Let us label all paths u with length at most I with independent random 
variables x{u) chosen uniformly from [0,1), forbidding any two labels to be 
equal. We define chain as a sequence ui,U2,... Ug of such paths for which 
|mi| + x{ui) > \u2\ + x{u2) > ... \us\ + x{us) and \/i G {1, 2, ... s — 1} there 
exists a common undirected edge of Ui and Ui+i (henceforth: these intersect 
each other). 

Lemma 5. For each I G N and e > there exists a q = q{l,s) G N for which 
for every graph G (with degrees bounded by d) and its undirected edge e, with 
random labelling, the probability that there exists a chain ui,U2, ... Uq for which 
Ui contains e is at most e. 

Proof. There exists an upper bound z — z{l) for the number of paths with length 
at most I intersecting a given path of length at most I. Hence, there are at most 
z'^ sequences of paths mi, M2, •■• Uq for which e is in ui and Vj G {1, 2, ... q — 1}, ut 
intersects Uj+i. All such sequences contain \q/V\ paths of the same length. The 
ordering of their labels the \q/r\ ! permutations with the same probability, so 
the probabhlty that the labels are decreasing is l/\q/V\ !. This event is necessary 
for the sequence to be a chain. Denote the number of chains in the lemma by 
the random variable X (with respect to the random labelling). We have 



P{X > 1) < E{X) < -p^Yf -^ where g ^ oo, 
T 



which proves the lemma for some large enough number q. D 

Proof of Theorem [H Consider the variant of the Edmonds-Karp algorithm 
where we augment on the one of the shortest augmenting paths with the lowest 
label, and we stop the algorithm when no augmenting path with length at most 
I remains. In other words, we start from the empty flow, we take the paths u 
with length at most I in the increasing order of |m| + x{u), and with each path, 
we increase the actual flow / by cap{f, u) ■ u. We denote this algorithm by Ai 
and the resulting flow by /i = fi{G,x). 

Consider now the variant of the previous algorithm where we skip augment- 
ing on each path which can be obtained as the first element of any chain 



with length s. We denote this algorithm by A2 and the resulting flow by 
/2 = f2{G,x). The next lemma shows that /2(e) is a constant-time algorithm. 

Lemma 6. For each edge e and labelling x, f2{G,x){e) — f2{hsi{e), 
{x\V{h,iie)me). 

Proof. Let us consider the run of the two algorithms in parallel so that when 
the first one takes a path w in G then if u is in hsi{e) then the second algorithm 
takes u as well, otherwise it does nothing. If at a point, the two flows differ at an 
edge e' G E{hsi{e)) then there must have been a path u through e' on which the 
two algorithms augmented by different values. There are three possible reasons 
of it: 

1. M is not in hsi{e)] 

2. u can be obtained as the first term of some chain in G with length s, but 
not in hsi{e)] 

3. u has an edge e" in hsi{e) at which the values of the two flows were different 
before taking u. 

Assume that at the end, the two flows are different on e. Using the previous 
observation initially with e' = e, let us take a path u through e' on which 
the two augmentations were different, and consider which of the three reasons 
occurred. As long as the third one, repeat the step with choosing e' as the e" 
of the previous step. Since by each step we jump to an earlier point of the 
runs, we must get another reason sooner or later. Denote the paths considered 
during the process by ui, M2, ... Ut- (Note that these are in reverse order on the 
augmenting timeline.) 

Consider the case when the reason for ut was the first reason. The set of 
all edges of all of these t paths is connected, it contains at most tl edges, it 
contains e and an edge at least si away from e, so tl > sZ, whence t > s. Thus 
ui,U2t-- Us is a chain with a connected edge set with size at most si, so this 
chain is in hsi{e), that is why neither runs should have been augmented on ui, 
contradicting with the definition of ui. 

On the other hand, if the reason for Uj was the second reason then by ap- 
pending ui,U2, ... Ut with the chain from Uj with length s, as its subchain, we 
get a chain starting with ui with length s, and it provides the same contradic- 
tion, n 

We prove that if /2 is the output of A2 with I — 2dM/e and using the 
function q of LemmaJH with s — q{l, e/(4(iM)) satisfies the following inequality. 

Ei\f2\)>Ei\h\)~'-n>\r\^sn (1) 

/i contains no augmenting path with length at most I, so using Lemma |31 

dM dM e 

—" = 1/ I-hm'^^I/ 1-3' 



l/il > in - —n = ir I - ^^n = in - -n. 



Consider now the first inequality. 



Lemma 7. If /i (x) (e) ^ /2 (x) (e) then there exists a path through e which is 
the first term of a chain with length s. 

Proof. Let us consider the run of Ai and A2 in parallel so that at the same 
time these take the same edge. If at a point of the runs, the two flows differ 
in an edge e' then there must have been a path u through e' on which the two 
algorithms augmented by different values. There are two possible reasons of it: 

1. u is the first term of a chain with length s; 

2. u has an edge e" on which the values of the two flows were different before 
taking u. 

Assume that at the end, the two flows are different at e. Using the previous 
observation, let us take a path u through e' on which the two augmentation 
were different, and consider which of the two reasons occurred. As long as the 
latter one, repeat the step with choosing e' as the e" of the previous step. Since 
by each step we jump to an earlier point of the runs, we must get the first 
reason in finite many steps. Denote the paths considered during the process by 
ui,U2t-. Uf. Then appending ui,U2,--- ut with the chain from ut with length 
s, as its subchain, we get a chain starting with ui with length s. D 

If fi{x){e) ^ f2{x){e) then by Lemma [3 there exists a chain with length 
q{l, j^), and Lemma[5]says that this has probability at most j^- But even 
if this occurs, fi{x) - f2{x) < M - (-Af) = 2M. That is why, 

Ei\fi\) E{\f2\) = Ei\f,{x)\ - \f2{x)\) = EQif^ix) - f2ix)\) 
^E{ J2 h{x){e)-f2{x){e))< J^ -^ ' ^^^ ^ 

s^S; e£out(s) s^S; e^out{s) 

< dn ■ -^— ■ 2M < -n. 
AdM - 2 

We are finished proving ([Ij. 

Now, let /2(e) = E{f2{x){e)). It is a flow because it is easy to check that 
it satisfies all requirements, and I/2I — E{\f2\) > |/*| — £?^. Furthermore, /2(e) 
depends only on hsi{e), so it can be calculated by a constant-time algorithm. 
Consequently, this assignment satisfies the requirements of the theorem. D 

Proof of Theorem O Let /2 be the flow constructed by the constant-time al- 
gorithm of the previous proof with error bound e/2, which therefore satisfies 
I/2I G [|/*| — |, |/*|], and let r be the radius used there plus 1. Using the notion 
7(6) = {1 if b is true, if false} for an event 6, let 

1 ^ 

g{hrM,hr{v2),...hr{vk)) = -Y,{I{v^&S) ^ /2(e)). (2) 

i—1 e^out{vi) 



As Iivi)J2eeout(v)hi^) S [0,dM], the Law of Large Numbers says that (O 
stochasticahy uniformly (with respect to G) converges to the following. 

^ E (^(-e^) E ^2('^))-^E E /2('=o)-i/2ie[iri-fjri]- 

■uGV(G) eeout(t)) seSeeout(t)) 

This implies that, for a large enough fc, these k, r and 5 satisfy the requirements. 

D 
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